Hindi vowel classification using GFCC and formant analysis in sensor mismatch condition

نویسندگان

  • ASTIK BISWAS
  • ANIRBAN BHOWMICK
  • MAHESH CHANDRA
چکیده

In the presence of noise and sensor mismatch condition performance of a conventional automatic Hindi speech recognizer starts to degrade, while we human being are able to segregate, focus and recognize the target speech. In this paper, we have used auditory based feature extraction procedure Gammatone frequency cepstral coefficient (GFCC) for Hindi phoneme classification. To distinguish vowels from each other, we have analyzed frequency response curves of each vowel. Here we propose a new feature extraction technique by taking first three formant frequencies of each vowel along with their cepstral features to increase the phoneme classification performance in noisy condition. The classification performance achieved by the proposed features is compared with the standard MFCC and GFCC based features using a continuous density hidden Markov model (CDHMM) with a mixture of Gaussian distributions. To evaluate robustness of these features in noisy environment, the NOISEX database is used to add different types of noise into vowels in the range of 0 dB to 20 dB. Furthermore robustness of new set of feature has been evaluated in the sensor mismatch condition. The classification results show that under noisy background as well as the sensor mismatch condition the proposed technique achieves a better performance over standard cepstral based features. Key-Words: MFCC, GFCC, Formant, HMM, Phoneme Classification.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study and Comparison of Formant Characteristics of Persian Vowels in 4-7-year-old Children Using Cochlear Implants and Those Using Hearing Aids

Background and Objective: One of the most important physical properties of vowels is their formant structure. One of the most obvious speech errors in hearing-impaired children is vowel errors. The present study aimed to determine and compare the formant structure of Persian vowels in deaf and cochlear implant children in the age range of 4-7 years. Materials and Methods: This descriptive-anal...

متن کامل

Mismatch negativity at Fz in response to within-category changes of the vowel /i/

The amplitude of the mismatch negativity response for acoustic within-category deviations in speech stimuli was investigated by presenting participants with different exemplars of the vowel /i/ in an odd-ball paradigm. The deviants differed from the standard either in terms of fundamental frequency, the first formant, or the second formant. Changes in fundamental frequency are generally more sa...

متن کامل

First formant difference for /i/ and /u/: A cross-linguistic study and an explanation

When the acoustic space that can be reached by a realistic articulatory model (Mermelstein, 1973) is explored systematically, an interesting asymmetry between front and back vowels is found (de Boer, 2009). The first formant of the highest possible back vowel [u] is lower than the first formant of the highest possible front vowel [i]. In order to test this observation in real languages, the val...

متن کامل

Formant Structure of Vowels Produced By Hindi Esophageal Speakers: A Comparative Study

Aim: To investigate the characteristics of vocal tract resonance in Hindi Esophageal Speakers. Methodology: Five normal Hindi speakers and five Hindi esophageal speakers participated in this study. They were asked to produce the three corner vowels /a/, /i/, and /u/ in the three different conditions like nonsustained, sustained and word level. The average first three formant frequencies F1, F2,...

متن کامل

بررسی اثر فیدبک شنوائی در تولید گفتار بعد از عمل کوکلئار ایمپلنت

The main goal of this study is to determine the auditory feedback effects in improvement of speech production process in prelingual totally deaf children who used cochlear implant prosthesis. For this reason, we recorded speech of four prelingual cochlear implant children pre and post of operation. Then we extract some static features of vowels-such as fundamental frequency, formant frequencies...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014